OOCIRA  B 


-7.  M  0  D 


1700  MAIN  ST  •  SANTA  MONICA 


.•trill  1937 

CoT7 'OT'  i '.ion 


CAIITOINI  A 


CLEARINGHOUSE  FOR  FEDERAL  SCIENTIFIC  AND  TECHNICAL  INFORMATION  CFSTI 

DOCUMENT  MANAGEMENT  BRANCH  410.11 


LIMITATIONS  IN  REPRODUCTION  QUALITY 


ACCESSION  s  /!/[)  ^  CWr  ■>' 


[Y1  I.  WE  REGRET  THAT  LEGIBILITY  OF  THIS  DOCUMENT  IS  IN  PART 

UNSATISFACTORY  REPRODUCTION  HAS  BEEN  MADE  FROM  BEST 
AVAILABLE  COPY 


n  2.  A  PORTION  OF  THE  ORIGINAL  DOCUMENT  CONTAINS  FINE  DETAIL 
WHICH  MAY  MAKE  READING  OF  PHOTOCOPY  DIFFICULT. 


n  3.  THE  ORIGINAL  DOCUMENT  CONTAINS  COLOR,  BUT  DISTRIBUTION 
COPIES  ARE  AVAILABLE  IN  BLACK-AND-WHITE  REPRODUCTION 
ONLY. 


Q]  4  THE  INITIAL  DISTRIBUTION  COPIES  CONTAIN  COLOR  WHICH  WILL 
BE  SHOWN  IN  BLACK-ANO-WHITE  WHEN  IT  IS  NECESSARY  TO 
REPRINT. 


n  5  LIMITED  SUPPLY  ON  HAND:  WHEN  EXHAUSTED,  DOCUMENT  WILL 
BE  AVAILABLE  IN  MICROFICHE  ONLY. 


n  6-  LIMITED  SUPPLY  ON  HAND:  WHEN  EXHAUSTED  DOCUMENT  WILL 
NOT  BE  AVAILABLE. 


Q  7.  DOCUMENT  IS  AVAILABLE  IN  MICROFICHE  ONLY. 


Q  B.  DOCUMENT  AVAILABLE  ON  LOAN  FROM  CFSTI  (  TT  DOCUMENTS  ONLY). 


□  9 


PROCESSOR: 

T;  »-l07-IO  64 


/  y 


p-n65 

'7-30-57 

-  1  - 


PREFACE 


This  is  a  draft  of  a  report  which  is  bein^  circulated  for  information 
and  comment.  We  hope  to  make  it  a  chapter  of  a  book  titled  Military 
Planning  In  An  Uncertain  World,  and  would  appreciate  any  comments,  criti¬ 
cism,  ideas,  and  examples  that  readers  may  liave.  This  draft  began  as  a 
transcript  of  an  informal  talk  and,  despite  some  rewriting,  it  probably 
still  suffers  (like  many  such  talks)  from  being  "fashionable."  We  are  aware 
that  it  has  a  number  of  other  weaknesses  and  assume  there  are  still  others 
of  which  we  are  not  aware.  We  hope  to  give  it  a  thoughtful  and  leisurely 
review  but  are  deferring  this  until  we  get  some  outside  criticism. 

A  table  of  contents  is  given  on  the  next  page  to  show  the  relation  of 
this  chapter  to  the  rest  of  the  book.  The  chapter  may  not  be  quite  self- 
contained  as  a  paper,  as  it  occasionally  refers  to  other  chaj)tersj  but  we 
trust  this  will  be  understood  or  overlooked. 

A  more  complete  introduction  and  list  of  acknowledgements  are  given 
in  i{M-l82’9-l. 


P-1165 

7-3'^-57 

-  2  - 

MILITARY  PLIL'NINO  I!I  AN  TICEFTAIN  V/OHLD 

corrrENTs  of  book 

I*  Techniques  of  Jystems  An&lysia^ 

1.  Designing  the  Offense 

2.  Probabilistic  Considerations 

3.  Designing  the  Defense 

li.  The  Two-Sided  War 

5.  Evaluation  and  Criticlsai 

n.  Techniques  of  Operations  Research 

6.  Flyaway  Kits— An  Application  of  and  Introduction  to 
Chapters  7  ■and  0 

7.  Elementary  Economics  and  Prograrrdng 

8.  Probability  and  Statistics 

2 

9.  Monte  Carlo 

10,  Game  Theory^ 

11 .  War  Oaraing^ 

III,  liillosopJiical  and  Methodological  Comments 

12,  Ten  Common  Pitfalls^ 

13.  ^^lne  Helpful  Hlnte 

l!i.  Miscellaneous  Comments 


P-1165 
7-30-57 
-  3  - 


INTHODUCTION* 

A  problem  doesn't  have  to  be  very  complicated  before  it  gets  too 
difficult  for  even  a  modern  high-speed  computer  to  do  in  a  straight¬ 
forward  fashion.  One  of  the  powerful  techniques  available  that  will  often 
make  a  seominply  intractable  ircblem  tractable,  if  not  easy,  and  one  that 
is  partici;larly  well  suited  th(  electronic  computer,  is  the  so-called 
Monte  Carlo  metiiod.  This  fas'.ionatle  nam.e  has  teen  given  to  any  technique 
which  uses  sam^ding  to  e3tir’iat(‘  *iie  answer  to  a  [Tecicc  matiiematical 
problem.  A  game  of  chance  is  devised  with  the  projierty  that  the  average  of 
the  scores  of  a  large  number  of  plays  of  the  .’ame  is  the  number  being 
estimated,  ^lille  the  game  may  te  played  by  using  gambling  devices  such  as 
a  roulette  wheel,  dice,  or  coins,  usually  the  simi'lest  and  most  practical 
such  device  from  the  viewpoint  of  the  computer  is  a  table  of  random  numbers. 
Such  a  table  presents  a  strange  appearance  to  the  layman.  It  is  nothing  but 
a  haphazard  collection  of  the  digits  0  to  9,  but  extreme  care  is  taken  to 
see  that  the  collection  is  uniformly  haphazard  and  random.  By  using  this 
table  it  is  possible  tc  simulate  tht  play  of  any  ■o’chaiical  gambling  device. 

The  game  of  chance  can  ce  a  dir>'ct  analogue  of  the  problem  being 
studied  or  it  can  t.e  an  artificial  invonticn.  Tiie  only  j  roporty  that  it 
must  possess  is  that  its  avrag'  scr  re  is  ^hc  answer  to  the  lesired  problem. 


This  paper  was  largely  Irawn  from  another  i aper  written  by  one  of  us, 
"Use  of  different  Mon*r-  Carlo  Samp  ling  Techniques",  published  by  Wiley  and 
Sons  Co.,  "Symposium  on  Monte  Carlo  Metho.ir"  19^6.  The  reader  is  referred 
to  that  paj  er  for  a  more  technical  discussion.  We  are  indebted  to  W'ilcy 
and  Sons  Co.  for  allowing  us  to  lo  thiso 


P-1165 

7-30-57 

-  1^  - 

It  is  not  necessary  to  use  the  first  pame  that  comes  to  mind  since  it  is 
often  possible  to  render  this  estimation  more  efficient  by  making  chianges 
in  the  game  that  do  not  disturb  its  exjected  score.  The  new  game  may  be 
mere  efficient  because  it  is  less  erratic  or  perhaps  because  it  is  cheaper 
to  play. 

Therefore,  when  using  the  Monte  Carlo  tecimique  to  solve  a  problem, 
one  directs  attention  to  three  main  topics: 

(1)  choosing  the  probability  process  (picking  the  game  of  chance) 

(2)  generating  sample  values  of  the  random  variables  on  a  given 
computing  machine  (pla,>i.ng  the  garie) 

(3)  designing  and  using  variance  reducing  techjiicues  (modifying 
the  game  to  be  more  efficient). 

The  first  two  of  these  topics  were  discussed  in  the  example  on  the  attrition 
of  bombers  in  Chapter  2,  pages  l;6-50.  The  reader  is  referred  to  these  pages 
if  he  wishes  an  elementary  discussion  of  a  typical  Monte  Carlo  problem. 

This  chapter  will  discuss  only  the  last  topic,  how  to  increase  the 
efficiency  of  a  Monte  Carlo  calculation  b-.  proper  experimental  design.  It 
is,  of  course,  true  that,  in  actual  problems  one  cajinot  isolate  variance 
reduc'^ion  from  the  first  two  topics.  The  methods  that  can  be  used  to  reduce 
variance  arc  often  sharply  dependent  upon  the  probability  model  and  in  some 
cases  on  tnc  techj.iques  used  *o  renerate  valuer,  of  the  random  variables. 
Also,  the  great' st  gains  in  variance  reductif  n  are  often  made  by  exj'loiting 
specific  details  of  the  problem,  rather  than  by  routine  application  of 
general  principles.  However,  there  do  seem  to  be  some  general  ideas  on 
reducing  variance  which  can  be  used  in  many  problems.  Seven  techniques 
seem  tc  be  most  useful.  They  aret 


P-1165 

7-30-57 

_  5  - 


1.  Importance  Sampling 

2.  Russian  Roulette  and  Splitting  ‘ 

3.  Use  of  Expected  Values  (combination  of  analytic  and  probabilistic 
methods) 

Ii.  Correlation  and  Regression 

5.  Systematic  Sampling 

6.  Stratified  Sampling 

7.  Specialized  Techniques 

While  all  of  these  techniques  can  be  used  in  standard  statistical 
sampling  problems,  the  first  three  seem  to  have  found  particular  and 
specialized  usefulness  in  Monte  Carlo  applications  as  differentiated  from 
the  usual  applications  in  ordinary  sampling.  This  is  mainly  because  of  the 
fact  that  in  a  Monte  Carlo  problem  the  exj^eriraenter  has  complete  control  of 
his  sampling  procedure.  If,  for  example,  he  were  to  want  a  green-eyed  pig 
with  curly  hair  am  six  toes  and  if  this  event  had  a  non-zero  probaoility, 
then  the  Monte  Carlo  experimenter,  unlike  the  agriculturist,  could  immediate¬ 
ly  produce  the  animal. 

In  order  to  illustrate  the  general  nature  of  the  techniques,  we  will 
apply  them  to  a  very  simple  exam.ple  —  so  simple,  in  fact,  that  the  reader 
will  have  to  exercise  his  imagination  in  order  to  see  that  there  is  a  problem. 

Consider  the  problem  of  calculating  the  probability  of  obtaining  a  total 
of  three  when  two  ordinary  dice  are  tossed.  Each  die  is  of  the  standard  sort 
with  six  faces  labeled  from  one  to  six  and  constructed  so  that  each  face  has 
the  same  probability  (1/6)  of  being  on  top.  This  problem  can,  of  course,  be 
solved  analytically.  Any  particular  combination  of  the  dice  has  a  prob¬ 
ability  equal  to  l/6  times  l/6  of  occurring.  Since  there  are  two  combinations 


H-1165 

7-30-57 

-  6  - 


which  make  three  (one-two  and  two-one),  the  probability  of  getting  a  three 
in  a  random  toss  of  the  dice  Is  2/36  or  l/l6. 

In  doing  the  problem  by  Monte  Carlo  one  could  simply  toss  the  dice  N 
times,  count  the  number  (n)  of  successes  (threes)  and  then  estimate  the 
probability  (p)  of  success  by 


A 

P 


n 


(1) 


Tj^^ically,  ^  differs  from  p; 

error.  This  statistical  error  is 
2 

cf.  In  this  case, 

<r  ■ 


that  is,  the  estimate  has  a  statistical 
usually  measured  by  the  standard  deviation 

Vp(l-p) 

(2) 


The  percent  standard  deviation  or  standard  error  is  then  given  by 

loo<r  •  100  Ji-p 

~  (3) 

As  is  intuitively  clear,  and  as  is  showii  by  formula  (3),  this  error 
goes  down  (though  not  very  ra;idly)  as  the  number  of  trials  is  increased. 
In  what  follows,  we  will  illustrate  other  ways  than  Increasing  N  by  which 
the  error  can  be  decreased.  These  are  very  important  in  practice,  for  to 


Usually  one  would  not  toss  physical  dice,  but  simulate  the  tosses  '-fith 
the  aid  of  a  table  of  random  niunbers. 

2 

In  Monte  Carlo  problems  the  error  has  statistical  properties  whicn  can 
usually  be  described  in  the  following  manner.  The  probability  that  the 
absolute  value  of  the  error  will  be  larger  than  md'  is  given  by  the  following 
tablet  m  Frob. 

r^T  30 

1.00  .32 

2.00  o05 

3.00  „003 

ii.OO  .0001 

d'  is  called  the  standard  deviation  and  in  our  case  is  given  by  equation  (2) 
above.  A  more  complete  explanation  of  statistical  errors  can  be  found  in 
almost  any  elementary  book  on  statistics. 


P-1165 

7-30-57 

-  7  - 

diminish  a  standard  deviation  ten  fold  by  simply  increasing  N  requires  that 
N  be  increased  one  h\indred  fold.  We  can  often  get  improvements  of  the  same 
order  by  very  simple  changee  in  the  sampling  procedure. 


1.  IMPORTANCE  SAMPLING 


P-1165 

7-30-57 

••  8  — 


If  by  some  method  we  can  increase  the  effective  value  of  p,  equation 
(3)  shows  that  even  though  the  same  N  is  used,  the  percent  error  will  be 
reduced.  This  increase  in  the  effective  value  of  p  can  be  obtained  very 
easily.  We  could,  for  example,  bias  the  dice  so  that  the  probability  for 
either  a  one  or  a  two  would  be  twice  as  great  as  usual,  that  is  l/3  rather 
than  i/6.  This  could  be  dene  with  physical  dice  by  "loading"  them,  or  with 
mathematically  simulated  dice  by  using  a  biased  table  of  random  numbers. 

If  a  one  and  two  each  had  a  probability  l/3  of  occurring,  then  the  proba¬ 
bility  of  getting  a  three,  instead  of  being  l/l8,  would  be  four  times  as 
great  or  2/5.  The  percent  error  is  then  cut  by  slightly  more  than  a  factor 
of  two.  Of  course,  equation  (l)  can  no  longer  be  used  to  estimate  p  but 

p  ■  1  n 

ITl  (M 

must  be  used  instead.  The  l/h  in  equation  (L)  is  called  a  weighting  factor. 
By  using  it,  the  distortion  introduced  by  the  biased  sampling  is  removed. 

It  is  vital  to  appreciate  that  even  if  p  were  difficult  to  compute  (as  we 
are  in  effect  pretending)  the  weighting  factor  is  still  easy  to  compute. 

This  illustrates  the  general  idea  of  Importance  Sampling — which  is  to 
draw  samples  from  a  distribution  other  than  the  one  suggested  by  the  problem 
and  then  to  carry  along  an  appropriate  weighting  factor,  Wiich,  when  multi 
plied  into  the  final  results,  corrects  for  having  used  the  wrong  distribu¬ 
tion.  The  biasing  is  done  in  such  a  way  that  the  probability  of  the  sample's 
being  drawn  from  an  "interesting"^  region  is  increased}  the  probability  that 


The  words  "interesting"  and  "uninteresting"  refer  to  the  amount  of 
effort  or  interest  the  sensible  calculator  would  show  in  the  region. 


P-1165 

7-30-57 

.  9  - 

it  comes  from  an  "uninteresting”  region  is  correspondingly  decreasedo  The 
reader  should  verify  for  himself  that  it  would  be  good  to  carry  the  bias 
illustrated  in  the  example  to  its  natural  limit;  the  probability  of  getting 
a  one  or  a  two  could  be  increased  by  a  factor  of  three,  making  the  proba¬ 
bility  of  obtaining  one  of  these  numbers  l/2  and  making  the  probability  of 
obtaining  any  other  number  zero. 

This  natural  limit  is  not  the  ultimate  limit.  ?'or  example,  if  we  tossed 
the  dice  one  at  a  time,  then  we  might  want  to  bias  the  second  die  different¬ 
ly  from  the  first  one.  In  particular,  if  we  were  willing  to  let  the  biasing 
of  the  second  die  depend  on  the  outcome  of  the  first  throw,  we  might  consider 
the  following  scheme. 

1.  Increase  the  probability  of  getting  a  one  or  a  two  on  the 
first  die  by  a  factor  of  tliree.  This  means,  of  course, 
that  there  will  be  a  zero  probability  of  getting  any  other 
numbers. 

2.  If  the  first  iie  comes  up  one,  increase  the  probability  of 
the  second  die  coming  up  two  oy  a  factor  of  six;  if  the 
first  die  comes  up  two,  increase  the  probability  of  the 
second  die  coming  up  one  \y  a  factor  of  six. 

If  lis  sch'  me  is  fc]  lowed  every  toss  of  the  dice  will  yield  a  three  so  that 
the  number  cf  successes  (n)  will  re  equal  to  the  rrumber  of  trials  (N).  The 
weighting  factor  will  be  l/3  times  1/6  or  l/  and  the  estimate  will  be 

T3n  (M 

-  1 

TB 


which  is  exactly  equal  to  p.  '^e  have  le vised  =  sampling  procedure  which  has 


P-U65 

7-30-57 

-  10  - 


zero  variance.  It  is  always  possible  to  design  an  Importance  Sampling 
scheme  that  has  zero  variance,  but  the  computation  needed  to  compute  what 
the  proper  biasing  procedure  may  be  is  exactly  that  needed  to  solve  the 
original  problem  and  in  some  cases  even  more. 

Examples^ 

One  of  the  major  problems  on  which  Monte  Carlo  is  used  is  to  calculate 
the  probability  that  nuclear  particles  will  penetrate  shields.  In  such  a 
problem,  the  particle  starts  at  one  side  of  the  shield,  and  has  collisions 
of  different  types  with  the  atoms  of  the  shield,  finally  being  either  re¬ 
flected  backwards,  absoroed  in  the  shield,  or  transmitted.  The  calculation 
can  be  done  by  Monte  Carlo  by  simulating  the  particle  histories  with  the 
aid  of  random  numbers.  As  this  idea  of  Importance  Sampling  suggests,  the 
simulation  should  not  be  faithful.  For  example,  the  following  types  of 
random  events  (t^piically)  increase  the  probability  of  penetration  and 
should  be  emphasized  at  the  expense  of  equally,  or  even  more  probable,  but 
less  "importaiit , "  ones: 

a.  collisions  resulting  in  a  forward  direction  of  motion 

b.  collisions  that  result  in  small  energy  losses 

c.  long  forward  jumps  and  short  backward  jumps  (the  so-called 
exponentiad  transformation) 

d.  survival  vs.  absorption  (if  carried  to  the  limit,  this  can  be 
looked  on  as  an  application  of  Use  of  hjqjected  Values). 


Some  of  the  examples  discussed  at  tiie  end  of  eacii  section  may  be 
hard  for  tiie  non-prcfessional  to  fellow.  He  will  find  that  if  he  akip' 

the*,  the  later  text  starts  again  from  the  beginning. 


P-1165 

7-30-57 

-  11  - 

The  calculator  is,  of  course,  confronted  with  the  problem  of  how  far  to  go 
in  altering  the  sampling.  In  deciding  this  he  must  use  a  c>-mbination  of 
judgment  and  calculation  which  cannot  be  entered  into  here.'^ 

Another  important  application  of  Monte  Carlo  is  in  the  design  and 
analysis  of  reactors.  Here  again  we  are  studying  the  various  ways  in  which 
nuclear  particles— particularly  neutrons — behave  in  matter.  It  is  found 
that  those  which  wander  away  from  the  center  of  the  reactor  will  not  con¬ 
tribute  much  of  interest  to  the  process.  On  the  contrary,  these  neutrons 
which  wander  back  toward  the  fissionable  material  are  the  ones  which  con¬ 
tribute  most  to  the  answer.  The  sampling  must  then  be  designed  to  sample 
more  frequently  among  the  second  tjq^e  of  neutrons  and  less  frequently  among 
the  first  type  of  neutrons. 

Monte  Carlo  is  also  applied  to  Operations  Research  problems.  In  a 
typical  problem  of  this  ty]'e  we  might  try  to  calculate  the  vulnerability  of 
a  piece  of  equipment  or  of  an  airfield  to  some  offensive  weapon.  In  such 
cases,  one  often  Monte  Carlos,  for  example,  the  error  of  the  missile  which 
is  doing  the  destruction.  The  distribution  of  errors  is  determined  by  a 
parameter  called  the  Circular  Probable  Rrror  (CEP).  The  CEP  is  the  radius 
of  the  circle  about  the  aiming  : oint  into  which  the  missile  will  fall  S0% 
of  the  time.  Since  we  are  interested  her'^  in  lest  ruct  ion,  if  we  Monte  Carlo 
from  a  distribution  defined  troir.  a  smaller  CEP  t'r'.an  that  which  obtains  in 
the  real  world,  we  will  find  that  more  cl'  the  ir.teresting  {recesses  (hits) 
hat)pen  in  the  simulated  exi-erin.erit  tnan  woull  hai'j.en  naturally.  Tne  less 
interesting  [irocesses  (misses)  are  then  discriminated  against. 


See  provicas  rt  fo'rence  or  RAND  re;  ert  ."iM-l^j?  "Applications  of  Monte 
Carlo." 


p-n65 

7-30-57 

-  12  - 

For  another  example  in  the  same  field,  consider  queuing  problems.  In 
these  problems  one  is  often  interested  in  the  mean  and  variance  of  the 
waiting  time.  One  then  wishes  to  bias  the  sampling  to  emphasize  long  waits. 
This  could  be  done  by  sampling  from  new  distributions  that  simulate  increased 
traffic,  increased  servicing  time,  or  increased  servicing  requirements. 

It  is  worth  noting  that  any  set  of  samples  obtained  with  the  use  of 
Importance  Sampling  is  less  effective  in  estimating  certain  auxiliary  quan¬ 
tities  than  a  set  that  has  been  obtained  in  a  straightforward  fashion.  For 
instance  in  the  shielding  problem,  a  sainjling  design  that  leads  to  an  accurate 
estimate  of  the  probability  of  penetration  will  be  very  poor  for  estimating 
the  probability  of  reflection;  in  the  reactor  problem  the  suggested  sampling 
would  not  be  pood  at  estimating  leakage;  in  the  vulnerability  problem  we  will 
lose  information  about  light  iamage  and  the  location  of  misses;  and  finally 
in  the  queuing  problem  we  will  not  gat  a  good  estimate  of  the  idle  time  of 
the  servicing  facilities. 

It  is  in  fact  usually  (but  not  always)  true  that  to  design  an  efficient 
Monte  Carlo  calculation  one  must  direct  attention  to  those  things  he  is  really 
interested  in  and  ignore  other  aspects  of  the  problem.  It  may  even  be  better 
to  do  more  than  one  calculation  than  to  cempromise  the  goals  of  any  particular 
design.  This  can  be  a  serious  disadvantage  if  the  computation  is  lengthy,  as 
the  decision  about  what  aspects  to  cencenbrate  on  must  oe  made  early  and  there¬ 
fore  may  easily  be  made  wrongly. 


P-1165 
7-30-57 
-  13  - 


2.  RUSSIAN  ROULETTE  AND  SPLITTING^ 


Let  us  assume  that  the  dice  are  tossed  one  at  a  time  and  that  the  cost 
of  the  problem  is  measured  by  the  total  number  of  individual  tosses.  Now 
it  is  immediately  clear  that  if  the  first  die  is  tossed  and  if  it  happens 
to  come  up  three  or  greater,  it  will  be  impossible  to  f-et  a  total  of  three, 
no  matter  how  the  s<?ccr.l  die  comes  up.  Under  these  circujustances,  there  is 
no  point  in  making  th.e  second  toss  and  we  can  simply  record  a  zero  for  tlie 
experiment.  This  makes  it  unnecessary  to  toss  the  second  die  2/3  of  the 
time.  Therefore  on  the  averapt.  we  will  dr  1 /}  fewer  tosses  in  an  experiment. 

In  more  com;  licated  examples  wb.ere  the  sar.j'linr,  is  done  in  staj^es,  it 
is  often  possible  to  examine  the  soinf  lr.  at  each  stape*  and  clarify  it  as  being 
in  seme  sense  "interesting''  or  "uninteresting."  The  sensible  calculator  is 
willing  to  spend  more  then  an  average  amcr.i.t  of  work  on  the  "interesting" 
ones  and  less  on  the  "unintprosi ing"  ones.  This  can  he  done  iy  splitting  the 
"interesting"  sajn.  les  into  independent  branches,  thus  getting  more  of  them, 
and  by  killing  off  some  percent  (in  the  aheve  example  lOOX)  of  the  "uiiinteres- 
ting"  ones.  Th.e  first.  ;  recess  i 3:  lilting  and  the  second  Russian  Roulette. 

The  "killii.g  off"  is  usuall;.  Itn-,-  ty  a  ?u:  ;  I'^'men’ arp  ra.me  of  chance.  If 
the  sure  lemenlary  gajno  is  Ics*  th-^  oam;lo  is  killed;  if  it  is  won  ti.e  sample 
is  counted  wit:,  an  extra  v-.:;;,;  *  .  mar:e  u:  ‘.''rv  ti.‘  fact  that  i*  ran  seme  risk 
of  being  icilled.  Th-'  g'.cx  .has  .  sh  lla.-’l'*;,-  the  Russian  g.a-M.  of 

chance  played  with  J  v-  rs  a:.d  fer*-'  ■  ads--wher,oe  the  name. 


The  id'^a  of  Rassian  .Ic.lc'te  a.a  i 


3;’.l  +  tLn;  Is  similar  in  s;  irit  to  the 


P-116^ 

7-j0-;;7 

-  U  - 

sequential  sairi;  linj’  schemes  of  quality  coiitrol,  tliough  quite  different  in 
ietail.  It  vas  first  thcupht  of  in  connection  with  particle  diffusion 
problems.  Particles  that  qet  into  i nter'^stinp  regions  are  split  into  n 
independent  sub-particles,  each  with  one  n'th  of  the  weight  of  the  original 
particle.  Particles  that  get  into  uninteresuing  regions  are,  in  effect, 
ajnalgamated  into  a  smaller  number  of  heavier  particles.  In  this  way  the 
calculator  achieves  his  goal  of  allocating  his  effort  sensibly. 

Most  of  the  oxamj'les  mentioned  in  Importance  Sampling  also  could  be 
used  to  illustrate  the  use  of  Russian  Roulette  and  Splitting.  However,  there 
are  differences.  In  Importance  Sampling  th^  samples  are  forced  into  the 
regions  and  prever.tpd  !'rcm  enterinp'  "uninteresi  ing"  regions.  Sometimes, 
though,  tids  is  hard  to  arrange — particularly  when  the  sampling  distribution 
is  not  given  explicitly,  but  a  ccmj)licated  process  fo~  tting  sample  values 
is  giver,  instead.  In  tl.ese  circuir.s'^  ances  we  may  have  very  good  estimates  of 
thf^  relative  "importance"  of  :iiff‘=rent  regions  and  still  not  be  able  to 
arrange  for  t!  proper  biasing.  We  can  ti.en  use  Russian  Roulette  and 
Splitting  whi'ui,  in  effect,  dees  tire  same  thing  hoc;  that  is,  we  wait  to 
see  what  region  is  entered  and  ti.en  decide  what  the  size  of  the  sample 


siiould  be . 


3.  USE  OF  EXPF.CTKD  VALUES 


P-1165 
7-30-57 
-  1,^'  - 


If  the  sampling  is  being  done  in  two  stagrs,  then  even  if  we  aren't 

clever  enough  to  calculate  the  ccn'binatorics  of  the  whole  piroblem,  we  still 

might  be  clever  enough  to  notice  that  there  is  no  point  in  tossing  the 

second  die;  that  is,  once  the  first  die  is  tossed,  it  is  trivially  easy  to 

calculate  the  probability  of  obtaining  a  total  of  three.  For  example,  when 

the  first  die  comes  up  one,  the  only  way  we  can  get  the  three  total  is  for 

the  second  die  to  come  un  two.  This  event  obviously  has  a  prcbaoility  of  l/6. 

Similarly  if  the  first  die  comes  up  two,  th.e  only  way  to  get  three  is  for  the 

second  die  to  be  one.  Tiis  event  also  has  a  probability  of  l/6.  Finally, 

all  the  other  possibilities  for  the  first  die  (three  to  six)  have  a  zero 

probability  of  givlnr  three.  If  we  record  the  probabilities  rather  than 

toss  the  second  die,  then  it  is  a  fact  that  trie  average  of  these  probablli- 

7 

ties  is  an  estlrriate  of  p.  This  meti.od  simultaneously  reduces  the  number  of 
tosses  we  need  by  a  factor  of  tvv  arid  decreases  tie  variance,  so  that  ti.e 

tosses  we  do  make  are  more  effective. 

The  possibility  cT  using  '’X}ected  values  to  great  a'.vantage  occurs 
frequently  in  practice.  The  ill  us*  rat  icn  not  a  r+ 1  *fl  cial ,  in  many 

probabilistic  proLlems,  muc;.  of  tne  variai.ce  or  fi  .otua‘i^:',  is  introduced  by 
a  part  wriicn  can  calculated  malyticall^ ,  wiii^.e  anc'ier  part  which  is  iiard 
to  calculate  analytically  may,  ..n  fact,  r.ct  intro  muct;  fluctuation.  In 
these  cases  trie  sensii’le  calculator  ccrT:tjl;.--s  analytic  an.l  ;  O'- bat  i  1  i  Stic 
met;  od3--calc'.il  ating  analytol'-ai  ly  t:.a*  wlich  is  easy  and  Monte  Carloing  that 


7 

It  is  robat  ly  -.m  rth  wi.ile  t  ;  c  i  nt  cut  tt.a*  the  Use  of 
Values  is  quite  differe.nt  frem  ssirig  an  ■  x;  ■  ct  d-value  me  lei  ( 
]).  The  fcrmr.er  is  exact  ruvd  tt.e  latter  is  ar.  'i;  ;  rex i mat i>:n. 


i-.x-pected 
see  Ul.af  ter 


which  ia  hard. 


P-U65 

7-30-57 

-  16  - 


Examj  les 

Expected  values  can  be  used  in  all  the  problems  mentioned  under  Impor¬ 
tance  Sampling.  In  the  shielding  j)roblem,  for  example,  instead  of  counting 
the  particles  that  succeed  in  piercing  the  siiield,  we  can  calculate  the 
probability  after  each  collision  that  the  particle  vnuld  go  directly  through 
the  shield.  An  unbiased  and  improved  estimate  of  the  penetration  is  obtained 
by  s^lmming  these  probabilities. 

Another  application  of  the  same  idea  is  in  the  calculation  of  multiple 
scattering  effects  in  nuclear  cross  section  measurements.  In  this  calcula¬ 
tion  one  simulates  a  scattering  experiment  in  which  the  main  interest  is  in 
the  number  of  particles  that  enter  a  detector.  This  probability  may  be  very- 
small,  say  of  the  order  10  ^  or  10  ,  so  that  If  one  sampled  in  a  straight¬ 
forward  fashion,  there  would  be  practically  no  counts.  It  is,  however,  very 
simple  to  calculate  th.e  probability  that  the  particle  will  enter  the  detector 
from  any  point  in  the  system.  If  these  probabilities  are  recorded,  rather 
than  the  number  of  p'articles  counted,  ona  usually  finds  th.at  the  percent 
variance  is  enormously  reduced,  factors  of  10  ^  being  typical.  The  factor 
is  larger  in  this  case  than  in  the  previous  (shielding)  example  because  in 
this  {  roblem  mariy  samples  tend  to  get  Into  a  region  wi.ere  the  particle  has 
a  relatively  high  (though  absolutely  low)  }.robatility  of  getting  into  the 
detector;  in  the  previov.s  problem  this  is  not  true  unless  Importance 
Sampling  or  iiussian  I'louiette  and  S;  fitting  is  also  used. 

This  idea  caii  be  app.ied  to  more  general  problems  than  particle  diffu¬ 
sion,  For  example,  in  vulnerability  studies,  it  is  often  true  that  the 
problem  being  studied  involves  several  stages  and  that  in  the  final  stage 


P-1165 
7-30-57 
-  17  - 


there  ia  given  a  probability  £  of  achieving  or  not  achieving  a  kill.  It 
should  be  obvious  to  the  reader  at  this  pcint  that  it  would  be  wrong  to 
sample  from  this  binomial  distribution;  rather,  the  £'s  themselves  should  be 
recorded  and  their  average  used  for  tl.e  estimate. 

A  similar  situation  occurs  in  the  particle  diffusion  problems  mentioned 
previously.  Usually  there  is  a  ncn-zero  probability  that  the  particles  will 
be  absorbed  after  they  have  had  a  collision.  This  binomial  process  could  be 
sampled,  but  it  is  generally  more  accurate  not  tc  sample,  but  simply  to 
weight  the  particle  with  the  probability  of  survival,  multiplying  all  the 
probabilities  together  after  it  has  had  all  its  collisions.  This  increases 
the  average  length  (number  of  collisions)  of  a  history  and  therefore  its 
cost,  but  this  effect  is  usually  dwarfed  by  the  decrease  in  variance. 

Another  way  to  use  expected  values  is  tc  integrate  a  sample  over  tlie 
initial  conditions.  For  exa.T.ple,  in  particle  diffusion  problems,  it  is  not 
much  work  to  translate,  rotate,  or  reflect  histories  (thus  getting  new  ones) 
and  then  average  ti.ese  translated,  rotated,  or  rofle  ted  histories  over  their 
a  priori  p'rctabilities. 

The  thtree  tectiniques  discusse;  above  ran  be  most  effective  in  realistic 

applications.  The  aut.norc  are  familiar  wltr  cases  in  whici.  each  technique 

has,  by  its'^lf,  Jt.'creased  ■‘■•he  effective  variance  ly  factors  of  tiiC  ^  rder  of 
Ii  6 

10  to  10  ,  In  iM^st  cases  this  means  char.gir.g  tb.e  ;  rotlem  from  one  which 
cannot  be  :on“  tecause  it  would  oe  t.oc  '^xjensivc  or  lengthy  tc  cnc  wh'ch  is 
easii.y  acne  on  me  !ern  cor.;  utinr  mac.Mn',  s  or  evei.  by  har.l  computers.  How».  ver, 
these  large  re  lu^tions  wer^'  al .  associated  with  ph.ysics  or  engii.eering 
problems.  Ir.  Ofieraticn  desearch  ap'plicati' ns  the  reductions  while  irq.crtant 
are  much  raore  monest.  They  tend  to  fall  in  the  range  2-100.  Ttii  s  is  still 


P-1165 

7-30-57 

-  lb  - 


of  value  if  the  problem  is  at  all  lengthy. 

The  Lechjiiques  that  we  are  now  going  to  discuss  are,  in  general,  not 
as  effective  as  the  three  already  mentioned,  however,  they  often  are  very 
easy  to  use  and  may  2neld  worthwhile  improvements. 


P-1165 

7-30-57 

-  19  - 


U.  XRR£LATI0N  AND  REGRESSION 

Correlation  or  regression  is  used  when  we  are  going  to  make  comparisons 
or  calculate  differences  and  wish  to  eliminate  irrelevant  fluctuations  which 
do  not  affect  the  comparison  or  difference. 

In  order  to  illustrate  this  technique,  it  will  be  necessary  to  change 
the  problem  slightly.  Assume,  for  instance,  that  the  proprietor  of  one  of 
the  gaming  establishments  in  Las  Vegas  wants  to  change  the  rules  in  force  at 
his  dice  tables.  Under  current  rules,  if  a  ;  layer  tosses  a  2,  3>  or  12,  on 
the  first  tliTcw  of  dice,  the  player  loses.  If  he  tosses  a  7  or  11,  he  wins, 
and  if  he  tosses  a  h,  5i  6,  9,  or  10,  he  will  win  or  lose,  depending  on 

whether  or  not  that  number  or  a  7  comes  up  first  in  his  subsequent  tlirows. 

Now  let  the  rule  change  being  considered  be  the  interchange  of  the; 
roles  of  7  and  Li  and  assume  that,  unlike  most  of  the  ;  roprietors  in  Las 
Vegas,  the  one  we  are  c'-nsidering  is  unset  i.i  st  i  ca’ od  an  i  warns  to  determine 
by  sampling  what  the  channe  in  ni  s  reverrue  will  be.  The  ctvious  way  to  do 
this  is  to  run  two  sets  of  exf.eriment  s,  one  with  the  old  rul“S  and  one  with 
the  new  rules,  and  tl;er.  jom;  are  the  two  ^x;  er;  :'.en' all  y-iletermined  revenues. 
Under  tliOoe  circum.star.con,  one  is  sut>!  ract  ..:.g  *  wr  r'el'itiveip  large,  fluctu¬ 
ating  quantities  to  ne+erm-nn''  a  sm.all  (gi ur.t:  * y .  Ir.  gei-eral,  this  yi^-lis  a 
process  vith  larg<^  ;  erve:.*  error. 

There  is  a  tetter  way  tc.  do  this  (  T'  tiem.  Iris'tuid  of  running,  two  inle- 
pendent  name.:.,  the  prej  riot, or  could  run  only  o:,»  game  ar.d  ajiply  both  sets  of 
rules  SLr:i,.itaneojsly  tins  game,  .ri  fact,  he  can  rr.uose  to  (Stima^e  the 
difference  in  revenue  Jirvc'ly  n'her  tnan  tn*^  rf-v'enue  ttiat  would  be  achi.  ved 
urriof  earh  ^et  of  riles. 

This  clearly  a;t<''ints  tc  pla^.  mg  the  following  gruno: 


p-n65 

7-30-57 

-  20  - 


1.  Whenever  a  3  comes  up,  continue  to  toss  the  dice  until 
either  a  3  or  a  7  comes  up.  ^  i  the  first  happenstance, 
record  a  minus  two,  since  under  the  cld  rules  the  customer 
would  have  lost  a  dollar,  but  under  the  new  rules  he  wins 
one;  in  the  second  happenstance,  record  a  zero  because 
under  both,  sets  of  rules  the  customer  loses. 

2.  Follow  similar  process  if  a  U  comes  up. 

3.  If  a  number  other  than  3  or  U  comes  ud  tarmlnate  the  nlay 
then  and  there  and  record  a  zero.  (Because  of  this  rule, 
the  effects  cf  chance  fluctuations  in  the  proportionate 
number  of  times  that  the  numbers  2,  5,  6,  .  .  .,  11,  12 
coiae  up  are  eliminated  from  the  comparison. ) 

The  specific  game  that  is  played  is  quite  different  from  the  two  games 
that  are  being  compared.  In  this  case  there  are  three  sources  of  savings; 
firs*  only  one  .set  of  rames  is  played,  and  second  the  number  and  kinds  of 
chance  fluctuations  that  can  affect  tne  results  are  greatly  reduced  and 
lastly  the  stop  r\ile  makes  the  average  pame  shorter.  It  is  in  fact  generally 
true  that  i^  we  wisti  to  compare  two  or  mere  situations,  we  can,  by  combining 
this  comparison  into  a  sinrle  problem,  reduce  the  work  substantially.  As  in 
til'’'  exajnplo,  only  one  problem,  rather  than  several,  has  to  be  done,  and  the 
direct  estimate  of  the  difference  can  u8\ially  be  .made  more  accurately  than 
estimates  of  separate  individual  quantities. 

This  LS  a  substantial  virtue  of  the  Monte  Carlo  method.  In  many  com- 
,’)]icated  problems  we  are  not  actually  interested  in  absolute  values  but  only 
In  comparisons.  Ws  ma;,  wish,  for  example,  to  know  it  Strategy  A  is  better 


than  Strategy  B,  or  if  Er.gi.ueerinr  Desiun  A  is  better  than  Engineering  Design  B 


P-1165 

^-30-57 

-  21  - 


We  might,  in  fact,  not  even  believe  the  absolute  values  because  the  idealiza¬ 
tions  are  so  rough,  but  do  believe  in  the  existence  of  the  qualitative 
features  implied  by  differences  in  the  calculated  performances.  Monte  Carlo 
can  then  be  used  to  estimate  the  thing  that  we  actually  desire  to  know  and 
that  we  believe,  ana  we  can  thus  byqjass  the  estimate  of  less  important 
quantities.  Usually  however,  we  can  obtain  these  less  important  quantities 
also,  but  at  some  extra  cost. 

Correlated  Sampling  can  also  often  be  used  to  test  the  accuracy  of  an 
approximate  theory.  If  the  a^iproximate  theory  happens  tc  be  an  exact  treat¬ 
ment  of  an  idealized  situation,  and  if  the  idealized  situation  happens  to  be 
"structurally"  similar  to  the  unidealized  situation,  then  it  is  often  possible 
to  design  very  efficient  sampling  schemes  tc  calculate  the  difference  between 
the  id'alized  and  unidealized  situations.  The  answer  tc  the  problem  posed  by 
the  unidealized  situation  can  then  be  obtained  by  adding  together  the  results 
of  the  approximate  analytic  calculation  and  the  Monte  Carlo  difference  calcula¬ 
tion. 

Comparing  Difterent  Bombing  Strategics 

If  a  strategic  or  tactical  bomoing  camij->al.gn  is  studied  by  Monte  Carlo, 
one  usually  has  tc  estimate  the  elfects  of  th.e  following  sequence  of  random 

events. 


1. 

Number 

of 

aircraft 

that 

abort 

2. 

Number 

of 

aircraf  t 

SJ.Ot 

down  ny  area  ;efen3e  on  t.'ne  way  into  the 

target 

3. 

Number 

of 

aa  rcraf t 

w  n  a 

ST.ray  ’ur^j-;  t.uvi  -ational  t^rrors 

b. 

N'ur.oer 

c  f 

airc reft 

snot 

■  low;,  by  looil  '.efens'i  a*  tbe  *:i’','.et 

V.eatr.e 

r  CO 

nui ti ^ns 

over 

tar;’ct  (affrcts  re-cc  inaissa-ce  and  CEP) 

P-116^ 

7-30-57 

-  22  - 


6,  Place  where  bombs  land 

7.  Dainap;e  done 

H.  Number  of  aircraft  shot  down  bv  area  lefense  on  the  way  out 
of  the  target  area 

9.  Number  of  aircraft  that  don't  get  back  for  miscellany  of 
minor  reasons. 

In  comparing  different  bombing  strategies  it  is  often  effective  to  use 
correiaticn  to  cut  down  the  sample  size  required  to  get  significant  informa¬ 
tion.  If  the  correlation  is  done  by  using  the  same  random  numbers  the  com¬ 
puter  cannot  use  a  single  list  of  random  numbers  in  sequence  in  the  two 
problems,  for  they  would  scon  get  out  of  step.  He  can  either  throw  away  the 
excess  random  numiiers  or,  what  is  sometimes  better,  save  them,  for  use  on 
later  strikes.  For  example,  if  a  larger  number  of  targets  were  attacked  on 
the  first  strike  of  one  of  the  s'rategies,  the  extra  random  numbers  that 
were  used  to  determine  the  weather  on  these  excess  targets  can  be  saved.  If 
in  a  later  strike  ari  excess  number  of  targets  is  attacked  under  the  other 
strategy,  t.he  previously  saved  random  numbers  can  then  be  used  on  these 
targets.  Correlation  can  t.nus  be  achievtd  by  using,  the  same  random  numbers 
whenever  the  two  s’*ratcg,ies  give  rise  to  the  sarnie  type  of  contingencies — 
even  if  they  are  on  different  sir.kes  witn  different  planes  and  targets. 

Because  the  point  is  sometimes  misunacrstoc  i,  we  woul  :  like  to  em.[)hasize 
that  when  the  results  of  the  same  of  contingencies  are  being  correlated, 

the  continf’encies  do  not  necessarily  t  ave  tnr.*  same  detailed  ctiaracter.  If 
for  exaiTiple  ii;  Strategy  A,  n,  aircraft  come  up  tc  t  lie  area  defenses  and  in 

i 

Strategy  B,  n  aircraft  are  used,  the  nrcble.ms  can  still  be  correlated  by 
2 

using  the  same  ranrlom  ruumberr  in  commuting  Xlx-  numoer  that  survive.  This  is 


P-13 

7-3c)-‘:7 

-  23  - 

true  as  long  as  the  picking  is  so  arranged  that  "he  degrees  of  siiccess  are 
monotone  functions  of  the  uniform  random  numbers,  so  that  fluctuations  in 
the  values  of  the  uniform  random  numbers  affect  the  twe  situations  in  the 
same  qualitative  way. 

If  the  different  strategies  are  such  tiiat  a  definite  type  of  event  is 
all  -imj)ortant  to  the  comparison,  then  correlation  by  weighting  may  be  better 
than  by  using  the  same  random  numbers.  Kor  example,  if  the  effect  of  differ¬ 
ent  types  of  defensive  annament  is  being  studied,  the  same  kill  probabilities 
could  be  used  for  the  ei'.emy  fighters  in  the  sampling,  and  weighting  factors 
carried  along  to  account  for  the  differences  being,  stv.died.  The  correlation 
may  be  higher  if  this  is  ime,  necause  exactly  the  saiTie  number  of  bombers  is 
shot  down  each  time,  so  all  of  the  subsequent  history  is  the  same.  If  the 
correlating  were  done  by  using  the  sane  rajidom  r.umbers,  different  numters  of 
aircraft  would  be  shot  down  and  the  actual  pre  gress  of  the  two  strategic 
campaigns  mignt  be  qui*e  different.  It  wculd  still  oe  possible  to  obtain 
correlation  by  using  the  same  random  numbers  for  the  same  contingencies,  but 
it  is  unlikely  that  the  correjation  would  be  as  high.  Weighting  will ,  of 
course,  not  work  well  if  by  its  use  one  is  forced  to  use  probability  iensity 
functions  for  the  sampling  which  will  themselves  introduce  a  lot  of  variance 
because  the  sampling  is  th'-n  not  good  Importance  Sampling  for  all  the  cases 
being,  considered. 

Another  case  where  weik^htinr  might  be  p'referable  to  using  the  same 
random  numbers  would  be  wiien  two  different  reconnaissance  devices  were  being 
compared.  The  ;  ossible  weatner  situations  can  tiien  ie  classified  according 
to  the  following  criteria: 


lo  Both  devices  w\  rk 


<-  • 


Ono  '.N’orks  and  ot.her  does  not 


r-11^)'' 
7-3('-S7 
-  21.  - 

3.  Neither  works 

Only  situation  2  makes  a  difference  t'-tween  the  two  devices  so  that  in 
the  saunplin^  only  it  anculd  be  allowed  to  jccm:-.  j.:  1  and  /  occur,  the 
sample  would  ^ive  zero  for  the  estimate,  so  tr.ey  need  net  calculatei; 
only  the  fraction  r  of  time  ‘  hey  occur  is  r.eeded.  Account  of  this  is 
autorna'^  i  cally  ♦aken  ty  t  h.p  we  ‘'actors.  I !'  instead  of  weip.itinp 

factors  the  random  numoers  were  us'd  to  ic  the  correlating,  then  vl-r)  of 
the  tim*'  the  sample  would  be  ca^culatini’  zero  ar.d  be  wanted.  If  instead  of 
being  an  all  or  nothint:  situatim  the  l^vicps  have  different  probabilities 
of  working  as  the  weatrior  changes,  tnen  th^-  appropriate  modification  must 
bp  rniJo  in  the  samnling.  Tnis  last  is  as  much  an  examiole  of'  In;  ortance 
Sam;i]ini7  as  of  Correlation. 


P-1165 

7-30-57 

-  25  - 


3o  5Y3Ti!>lATIC  SAKPUNG 


II'  we  are  doing  a  multi-stage  SHi..pling  problem,  it  often  turns  out  t.o 
be  very  easy  to  lo  the  first  stage  systomaticall , .  For  example,  in  our 
problem,  if  we  are  going,  to  toss  the  dice  one  at  a  time,  tnen  there  is 
really  no  point  in  actually  tossing  the  -first  die.  If,  for  example,  we 
were  planning  on  getting  600  sam.ples,  we  would  exooct  on  the  average  that 
each,  die  would  come  up  one  about  100  times,  two  another  100,  and  so  on.  It 
is  easy  to  show  that  we  d  c  net  bias  tr.e  results  if  we  assume  tiiat  the  first 
100  tosses  of  ti.e  first  die  actually  dc  come  up'  one,  t!ie  second  IOC  tosses 
of  this  die  come  up  two,  etc.  and  so  only  toss  tt-.e  seconi  die.  The  main 
advantage  in  loing  this  is  that  we  have  elimir.atei  th.e  error  caused  by 
fluctuations  in  the  p  rope rtio.’.s  of  ones,  twos,  etc.  wnich  would  result  if 
the  first  toss  was  ran  iom. 

In  practice,  however,  uoir.g  th-*  first  stag.e  of  the  Sc'imp  ling  systemati¬ 
cally  does  not  usually  leai  to  sues* an’- ial  :n;  rovements  in  efficiency, 
lenerally,  in  fact,  it  will  ci.i.y  r^^dnee  the  r.nmoer  of  sair.pdes  required  by 
relatively  few  :ercent--3ay  ■  to  yO.  Howt-ver,  it  or-linarily  loes  r-.ot  cost 
anything  to  apiply  this  U?c(u:ique ,  jo  ttiat  tnere  is  nc  point  in  not  using  it. 
Also,  it  is  sometimes  i  n*  er'"* -^t  i  nr  tt.us  t^  ej''ir.ate  ;.l>w  t  ;.e  >y,p*-cted  score 
depends  on  what  haipens  at  the  first  ;  * 'ik-e .  An-.’  the  or,]y  •  ^me  we  may  not 

be  able  tc  use  i*  conveuien*’ ly  is  wt  en  we  io  ynrw  in  a  1vh-.c«  how  ilg  a 
sample  we  will  wari*  ,  and  even  tt.en  it  may  n-  •  ;  i  .practical. 

The  main  a;  r  licat  ior.  "f  .lyntematic  S/im.'  Iin,'  is  in  ti.cs'’  multi-stage 
problems  wt.'’re  it  is  trivial  tc  cairulat^  tt;*-  i  1  st  r :  t  at  i  .■  r.  of  events  at  the 
first  stare.  In  tha*  case,  the  c-cr.;ling  sn-'  .11  he  j(.:'.e  sys*  ^ma’ icaily. 


p-n65 

7-30-^7 

-  ^6  - 


6.  STitATIriyp  :^A>*.PLING 

This  tecl’inicue  ib  a  sort  of  co^.bi nation  of  importaiice  Sar.plinf;  and 
Systematic  Sampli.ng.  For  examr.lo,  if  we  were  oniy  a  little  bit  sojiiisti- 
cated  and  were  doing  the  systematic  sairipling  lescribed  above,  we  woul-i  r  oon 
notice  that  there  is  no  point  in  i~'jnsiderln,’  ^he  aOO  tosses  in  Which  we  had 
assirned  the  values  thj'ee  to  six  for  the  first  toss  of  the  die,  since  under 
these  circumstances,  we  can  never  .-et  o  total  cf  three.  T.ierefore,  we  mig.l't 
systematically  divide  ’'he  sample  into  t.alvt-s  ratner  tiioi.  sixths.  In  the 
first  half  wc  w^rcli  say  that  the  first,  aie  came  up  me,  and  in  the  second 
half  that  the  first  die  came  up  two. 

In  theory,  this  metnod  coulc  Ln;  as  'owerful  as  importance  Sampling. 

In  actual  -.iractice,  the  fact  that  you  have  tc  sajr.p'le  systematically  turns 
out  tc  tecrease  sharply  the  number  of  places  in  which  it  can  be  used  con¬ 
veniently.  However,  at  these  ;  lar-  s  in  a  calculation  where  it  can  be  used, 
it  is  usually  better  than  Imrcrtance  dampling  and  in  any  case  never  worse. 
Therefcre,  whenever  the  costs  cf  the  two  'ectniquef  are  comparable.  Strati¬ 
fied  Sar.tlinp  is  {jreftrahlf  tc  I.mport ar.c.,-  Sam;  i:ng. 

The  last  remark  on.  the  ap;  ;icaticn.s  of  Systematic  Sam)  ling  also  aijjilies 
to  Stratified  Sar.pling:  it  is  usually  useful  when  it  ;s  trivial  to  calculate 
the  distribut  i  of  ‘•vent.s  at  the  first  stape.  There  is  oiuy  the  additional 
fact  that  in  more  gye.erai  ;  roble.ms  .ne  must  have  some  iaea  cf  the  relative 
imp.rt  ance  where  ir.iortar.ce  is  defined  diff*.  rerrly  from  t.he  systematic 
samjling  caije. 


7.  3PFC:AL17.i'.;)  'I  CiJj.wL'ES 


i !’  uii'‘‘  nor>:  iPrcwirif.  tho  dic*^  one  at  a  tin.e  in  a  naive  fas;. ion,  it  would 
be  a  ::',lr;*’ake  ♦c  oe; 'arate  throws  ir.to  lisjoii/  ;  airs  and  look  just  a^  these 

j/air.a  to  if  t.h'^y  !;a.l  r  reduced  any  threes.  Kc  r  exa^i;  le,  cr.3  can  look  not 

on*y  a*  di  "'e  o:;e  and  tw  and  'tien  at  tnree  and  four,  but  also  dice  two  and 

ft 

th;'oe  a;,d  als.  four  and  ’’ive.  One  will  ther.  liavc  doubled  to  a  first 
ay  p’oxixa*  ic.’!  the  total  (ft'octive  nuribor  of  li.rcwb  for  th.e  sa.ae  ameunt  of 
work.  Aciially  this  iouilir.f  is  not  a.o  eff<:-ctiVe  a;i  one  nif.ht  first  think 
because  there  is  a  hijh.  correlation  t^'-tweeri  successive  throws  and,  therefore, 
the  fluctuatior:  will  ro^.  be  lecreased  as  much  as  if  all  the  ;  airs  had  been 
ir.de:  <^t  dent .  However,  th:  s  tecni.ique  s;'.ould  be  used  because  it  is  esse.diaily 
free. 

.  For  a.nctr^r  txajr.nlo,  assuine  that  one  haL;ei.pd  to  have  a  :r.ac:.ine  wnich 
could  throw,  3ay,  six  dice  at  a  ti.ne.  One  way  to  use  tr.is  machine  would  be 
to  SL.T.fly  yrouj  the  six  diop  into  three  disfcir.t  pairs  which  would  correspond 
to  tf.e  .naive  ‘^stin.atc.  A  .much  better  way  would  be  to  ccnsiier  all  ncssitle 
;air.s  of  dicf  t.hat  co  .In  be  ortainea  from  the  six.  This  is  .not  ts  diffic\;lt 
as  mipnt  seem  a*  fi.rst  flange.  Ono  wo, .Id  flrsi,  calculate  the  total  number  of 
possible  ja.rs  th.at  cun'  uc  ...  have  ^6  x  '■>/?  •  1:)  and  ^.hen  by  su^erl'iclal 
ol  s'-'rvaticn  '"ind  the  r.u.T.ter  cf  jairs  wrd.nn  roul;  yiel  i  a  total  '^f  tnree  [i.e., 
if  there  are  .t.  2's  a;id  n  1'.'-,  there  are  m  x  n  ;  airs  that  will  yield  j's;  the 
p.uti.’i.ate  ..  i  11  then  r-t  :t,  x  r.  /l^  for  t  iiat  sajc;!-]. 

The  reader  will  u:. ucubte  lly  thinr,  rf  many  ctner  s;  ecia^ized  estimates 
that  de:  er.i  upon  t.'.e  fact  t.ha'  r.e  is  rnocifically  calculating  the  probability 

- R - 

are  indebted  to  J.  T.  .dcbackar  fo."-  s. pre ''t  1  np,  t’nis  idea. 


P-1165 

7-30-57 

-  28  - 


'  •  '  \ 

\  ' 

\ 

V 

of  getting  a  three  on  a  throw  of  two  dice  and  v) Ich  could  not  be  generalized 
to  other  problems.  This  lack  of  generality  is  no  reason  for  not  investing 
at  Mast  a  modest  effort  in  looking  for  such  methods  aid  using  them. 

f 

# 

It  is  probably  clear  to  the  reader  that  the  problems  faced  Ly  the  Konte 
Carlo  experimenter  iri  trying  to  cut  Jown  ^lio  statistical  fluctuations  are  in 
some  respects  similar  tc  these  th^t  arc  faced  in  aLr.ost  any  ajjplication  of 
sampling.  Therefore,  much  of  th^-  ^ileraturt'  of  statistics  is  relt-vant  to 
the  problems  we  have  been  crnsiaering.  in  fact,  a  fairly  complete  discussion 
of  the  fifth  aod  sixth  techniques,  »id  to  a  slightly  lesser  extent,  t't>e  pre¬ 
ceding  two,  can  be  found  in  many  statistics  textbccksj  only  the  first  two  '^o 
not  seen  to  be  applicable  tc  crairiOry  statisnical  practice  and  have  therel’ore 
not  been  discussed.  For  tr.is  reason  it  is  very  valuable  tc  have  prcfersional 
statistical  help  in  designing  these  calculations.  However,  if  one  has  to 
choose  between  a  person  whe  is  mainly  interested  in  statistics  and  or.e  who  is 
mainly  interested  in  the  problem  i'.seil',  experience  has  shown  that,  in  tlds 
field  at  least,  the  latter  is  ;  referable.  This  last  ren.ark  is  not  intended 
as  a  slur  on  statisticians,  but  simply  to  amplify  a  comjnent  made  eaj'lier,  that, 
"the  greatest  gains  in  variance  reduction  are  often  made  by  ■-xpioiting  specific 
details  cf  the  problem,  rather  Ilian  by  the  routine  application  of  genei’al 


;  rii.ciples. " 


